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Novel compounds 

The present invention relates to novel bacterial genes and processes for 
improving the manufacture of clavams e.g. clavulanic acid. The present invention 
also provides novel organisms capable of producing increased amounts of clavulanic 
5 acid. 

Microorganisms, in particular Streptomyces sp. produce a number of 
antibiotics including clavulanic acid and other clavams, cephalosporins, polyketides, 
cephamycins, tunicamycin, holomycin and penicillins. There is considerable interest 
in being able to manipulate the absolute and relative amounts of these antibiotics 
0 10 produced by the microorganism and accordingly there have been a large number of 
studies investigating the metabolic and genetic mechanisms of the biosynthetic 
^* pathways [Domain, A.L. (1990) "Biosynthesis and regulation of beta-lactam 

W antibiotics." In : 50 years of Penicillin applications, history and trends]. Many of the 



a 



enzymes which carry out the various steps in the metabolic pathways and the genes 



*| 15 which code for these enzymes are known. 



Clavams can be arbitrarily divided into two groups dependent on their ring 
stereochemistry (5S and 5R clavams). The biochemical pathways for the biosynthesis 
of 5R and 5S clavams have not yet been fully elucidated but it has been suggested 
that they are derived from the same starter units (an as yet unidentified 3 carbon 

20 compound [Townsend,C.A. and Ho, M.F. (1985) J. Am. Chem. Soc. 107 (4), 1066- 
1068 and Elson, S.W. and Oliver, R.S. (1978) J. Antibiotics XXXI No.6, 568] and 
arginine [ Valentine, B.P. et al (1993) J. Am Chem. Soc. 15, 1210-1211] and share 
some common intermediates [Iwata-Reuyl, D. and C.A.Townsend (1992) J.Am. 
Chem. Soc. 114: 2762-63, and Jane, J.W. et al (1993) Bioorg. Med. Chem. Lett. 

25 3:2313-16]. 

Examples of 5S clavams include clavam-2-carboxylate (C2C), 2- 
hydroxymethylclavam (2HMC), 2-(3-alanyl)clavam, valclavam and clavaminic acid 
[GB 1585661 , Rohl, F. et al. Arch. Microbiol. 147:315-320, US 4,202,819 ] There 
are, however, few examples of 5R clavams and by far the most well known is the beta 
30 lactamase inhibitor clavulanic acid which is produced by the fermentation of 
Streptomyces clavuligerus. Clavulanic acid, in the form of potassium clavulanate is 
combined with the beta-lactam amoxycillin in the antibiotic AUGMENTIN (Trade 



rl 

Mark SmithKIine Beecham). Because of this commercial interest, investigations into 
the understanding of clavam biosynthesis have concentrated on the biosynthesis of 
the 5R clavam, ciavulanic acid, by S. clavuligerus. A number of enzymes and their 
genes associated with the biosynthesis of ciavulanic acid have been identified and 
5 published. Examples of such publications include Hodgson, J.E. el al. Gene 166, 49- 
55 (1995), Aidoo, K.A. et al, Gene 147, 41-46 (1994), Paradkar, A.S. et al, J. Bact. 
177(5), 1307-14 (1995). In contrast nothing is known about the biosynthesis and 
genetics of 5S clavams other than clavaminic acid which is a ciavulanic acid precursor 
produced by the action of clavaminic acid synthase in the ciavulanic acid biosynthetic 

10 pathway in S. clavuligerus . 

Gene cloning experiments have identified that S. clavuligerus contains two 
clavaminic acid synthase isoenzymes, casl and cas2 [Marsh, E.N. et al Biochemistry 
31, 12648-657, (1992)] both of which can contribute to ciavulanic acid production 
under certain nutritional conditions [Paradkar, A.S. et al., J. Bact. 177(5), 1307-14 

15 (1995)]. Clavaminic acid synthase activity has also been detected in other ciavulanic 
acid producing micro-organsims, ie. S. jumonjinensis [Vidal, CM., ES 550549, 
(1987)] and S .katsurahamanus [Kitano, K. et al, JP 53-104796, (1978)] as well as 
S. antibioticos, a producer of the 5S clavam, valclavam [Baldwin, J.E. et al, 
Tetrahedron Letts. 35(17), 2783-86, (1994)] . The latter paper also reported S. 

20 antibioticos to have proclavaminic acid amidino hydrolase activity, another enzyme 
known to be involved in ciavulanic acid biosynthesis. All other genes identified in 
S. clavuligerus as involved in clavam biosynthesis have been reported to be required 
for ciavulanic acid biosynthesis [Hodgson, J.E. et al, Gene 166, 49-55 (1995), 
Aidoo, K.A. et al, Gene 147, 41-46 (1994)] and as yet none have been reported 

25 which are specific for the biosynthesis of 5S clavams. 

We have now identified certain genes which are specific for the biosynthesis 
of 5S clavams as exemplified by C2C and 2HMC in S. clavuligerus. Accordingly the 
present invention provides DNA comprising one or more genes which are specific 
for 5S clavam biosynthesis in S. clavuligerus and which are not essential for 5R 

30 clavam (e.g. ciavulanic acid ) biosynthesis. 

By "gene" as used herein we also include any regulatory region required for 
gene function or expression. In a preferred aspect the DNA is as identified as Figure 
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1. Preferably the DNA comprises the nucleotide sequences indicated in Figure 1 
designated as orfup3, orfup2, orfupl, orfdwnl, orfdwn2 and orfdwn3. The present 
invention also provides proteins coded by said. DNA. The present invention also 
provides vectors comprising the DNA of the invention and hosts containing such 
5 vectors. 

Surprisingly we have found that when at least one of the genes according to 
the invention is defective the amount of clavulanic acid produced by the organism is 
increased. Accordingly the present invention also provides processes for increasing 
the amount of clavulanic acid produced by a suitable microorganism. In one aspect 

10 of the invention the genes identified can be manipulated to produce an organism 
capable of producing increased amounts of clavam, suitably clavulanic acid. The 
findings of the present work also allow an improved process for the identification of 
organisms with higher clavulanic acid production comprising a preliminary screening 
for organisms with low or no 5S clavam production ( for example by hplc and/or 

15 clavam bioassay as described in the examples herein). 

Suitably the 5S clavam genes of the present invention can be obtained by 
conventional cloning methods (such as PCR) based on the sequences provided 
herein. The function of the gene can be interfered with or eliminated/deleted by 
genetic techniques such as gene disruption [Aidoo, K.A. et al., (1994), Gene, 147, 

20 41-46]., random mutagenesis, site directed mutagenesis and antisense RNA. 

In a further aspect of the invention there are provided plasmids containing one 
or more defective genes, preferably the plasmids pCEC060, pCEC061, pCEC056 
and pCEC057, described below. 

Suitably, the plasmids of the invention are used to transform an organism such 

25 as 5. clavuligerus, e.g. strain ATCC 27064 (which corresponds to S. clavuligerus 
NRRL 3585). Suitable transformation methods can be found in relevant sources 
including : Sambrook, J., Fritsch, E.F. and Maniatis, T. (1989), Molecular cloning: a 
laboratory manual, 2nd Ed., ColdSpring Harbor Laboratory, Cold Spring Harbor, 
N.Y ; Hopwood, D.A. et al. (1985), Genetic Manipulation of Streptomyces. A 

30 Cloning Manual, and Paradkar, A.S. and Jensen, S.E. (1995), J. Bacterid. 177 (5): 
1307-1314. 
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Strains of the species S. clavuligerus are used industrially to produce 
clavulanic acid (potassium clavulanate). Within the British and United States 
Pharmacopoeias for potassium clavulanate (British Pharmacopoeia 1993, Addendum 
1994, p 1362-3 and U.S. Pharmacopeia Official Monographs 1995, USP 23 NF18 
5 p384-5) the amounts of the toxic 5S clavam, clavam-2-carboxylate, are specifically 
controlled. 

Therefore in a further aspect of the invention there is provided an organism 
capable of producing high amounts of clavulanic acid but has been made unable to 
make C2C or capable of producing high amounts of clavulanic acid but able to make 

10 only low levels of C2C. Suitably the clavulanic acid producing organism contains one 
or more defective clavam genes, and is preferably the S. clavuligerus strain 56- 1A, 
56-3A, 57-2B, 57-1C, 60-1 A, 60-2A, 60-3A, 61-1A, 61-2A, 61-3A, and 61-4A, 
described below. Such organisms are suitable for the production of clavulanic acid 
without the production of the 5S clavam, clavam-2-carboxylate or with significantly 

1 5 reduced production of clavam-2-carboxylate. 

EXAMPLES 

In the examples all methods are as in Sambrook, J., Fritsch, E.F. and Maniatis, T. 
(1989) Molecular Cloning A Laboratory Manual (2nd Edition), or Hopwood, D.A. et 
20 al. (1985) Genetic Manipulation of Streptomyces. A Cloning Manual, and Paradkar, 
A.S. and Jensen, S.E. (1995) J. Bacterid. 177 (5): 1307-1314 unless otherwise 
stated. 

I. DNA sequencing of the Streptomyces clavuligerus chromosome upstream 
25 and downstream of the clavaminate synthase gene casl . 

A. Isolation of casl. 

To isolate chromosomal DNA fragments from Streptomyces clavuligerus 
NRRL 3585 encoding the gene for clavaminate synthase isozyme 1 (casl) an 
oligonucleotide probe RMOl was synthesised based on nucleotides 9-44 of the 
30 previously sequenced casl gene (Marsh, E.N., Chang, M.D.T. and Townsend, C.A. 
(1992) Biochemistry 31: 12648-12657). Oligonucleotides were constructed using 
standard methods on an Applied Biosystems 39 1 DNA Synthesiser. The sequence of 
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RMOl, a 36-mer, was synthesised in the antiparallel sense to that published by Marsh 
et al (1992, ibid) RMOl was radiolabeled with 32p us ing standard techniques for 
end-labelling DNA oligonucleotides (Sambrook et al., 1989 ibid), and was used to 
screen a cosmid bank of Streptomyces clavuligerus genomic DNA by Southern 
5 hybridization as described by Stahl and Amann (In : Nucleic acid techniques in 
bacterial systematics. Ed. E. Stackebrandt and M. Goodfellow. Toronto : John Wiley 
and Sons, p. 205-248, 1991). The genomic bank of S. clavuligerus DNA, prepared 
in cosmid pLAFR3, was as described by Doran, J.L et al, (1990), J. Bacteriol. 172 
(9), 4909-4918. 

P| 10 Colony blots of the S. clavuligerus cosmid bank were incubated overnight 

W with radiolabeled RMOl at 60°C in a solution consisting of 5 x SSC, 5 x Denhardt's 

S| 

U solution, and 0.5% SDS (1 x SDS: 0.15 M NaCl + 0.015 M Na3citrate; 1 x 

jj{ Denhardt's solution: 0.02% BSA, 0.02% Ficoll, and 0.02% PVP). The blots were 

$ then washed at 68°C for 30 minutes in a solution of 0.5 x SSC + 0.1% SDS. One 

O 15 cosmid clone, 10D7, was isolated that hybridised strongly to RMOl and gave 

ru 

Q hybridization signals upon digestion with restriction endonucleases Sacl and EcoRI 

that were consistent with hybridization signals detected in similar experiments with 
ill digests of S. clavuligerus genomic DNA. 

20 B. DNA sequencing of the S. clavuligerus chromosome flanking casl. 

A partial restriction map of cosmid 10D7 was generated using restriction 
endonucleases Sacl, Ncol, and Kpnl. Southern hybridization experiments between 
RMOl and various digests of 10D7 DNA indicated that casl was most likely located 
at one end of a 7-kb Sacl-Sacl DNA subfragment. This fragment consisted of the 

25 casl open reading frame and approximately 6 kb of upstream DNA. The 7-kb 
fragment was then subcloned from a Sacl digest of 10D7 in the phagemid vector 
pBluescriptll SK+ (2.96 kb; Stratagene), thus generating the recombinant plasmid 
pCEC007. 

To facilitate the process of sequencing the chromosome upstream of casl, a 
30 3-kb Ncol-Ncol subfragment of the 7-kb Sacl-Sacl fragment was subcloned in 
pUC120 (3.2 kb; Vieirra and Messing, Methods Enzymol. 153, 3-1 1, 1987)) in both 
orientations, generating the recombinant plasmids pCEC026 and pCEC027. The 3- 
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kb subfragment consisted of the arruno-terminal-encoding portion of casl and 
approximately 2.6 kb of upstream DNA. 

Nested, overlapping deletions were created in both pCEC026 and pCEC027 
using exonuclease III and SI nuclease digestion (Sambrook et al., 1989 ibid) and the 
5 DNA sequence of the 3-kb Ncol-Ncol fragment was determined on both strands by 
the dideoxy chain termination method (Sanger, F., Nicklen, S. and Coulson, A.R. 
(1977), Proc. Natl. Acad. Sci. U.S.A. 74: 5463-5467) using a Taq dye-deoxy a 
terminator kit and an Applied Biosystems 373 A Sequencer. 

To determine the DNA sequence of the chromosome immediately 
10 downstream of casl a 4. 3-kb KpriL-EcoRl DNA fragment was subcloned from 
C5 cosmid clone 10D7 in pBluescriptll SK+, generating pCEC018. From pCEC018 a 

il 3.7-kb Sacl-Sacl subfragment was cloned in pSLl 180 (3.422 kb, Pharmacia); one of 

the Sad termini of this fragment partially overlapped the TGA stop codon of casl, 
Wl the other was vector encoded. Both orientations of the 3.7-kb fragment were 

P 15 obtained during subcloning and the resulting recombinant plasmids were designated 
J*;- pCEC023 and pCEC024. Nested, overlapping deletions were created in both 

plasmids and the DNA sequence of the 3.7-kb fragment was determined on both 
f|| strands. The nucleotide sequence of the S. clavuligerus chromosome generated in 

these experiments, including and flanking casl sequence is shown in Fig.l. 

20 

II. Functional analysis of the open reading frames flanking casl. 

Computer analysis of the DNA sequence upstream of casl predicted the 
presence of two complete orfs and one incomplete orf . All three orfs were located 
on the opposite DNA strand to casl and were thus oriented in the opposite direction. 

25 The first open reading frame, orfupl, was located 579 bp upstream of casl and 
encoded a polypeptide of 344 amino acids (aa). The second open reading frame, 
orfupl, was located at 437 bp beyond the 3'-end of orfupl and encoded a 151 aa 
polypeptide. Beyond orfup2 is orfup3. The start codon of orfup3 overlaps the 
translationai stop codon of orfupl, suggesting that the two orfs are translationally 

30 coupled. No translationai stop codon for orfup3 was located on the 3-kb Ncol-Ncol 
fragment. 
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A similar analysis of the DNA sequence downstream of casl predicted the 

presence of two complete orfs and one incomplete orf . Two of the orfs were located 

on the opposite DNA strand to casl and were thus oriented towards casl. The third 

orf was located on the same strand as casl and was thus oriented away from it. The 

5 first downstream open reading frame, orfdwnl, was located 373 bp downstream of 

casl and encoded a 328 aa polypeptide. The second open reading frame, orfdwnl, 

was located 55 bp upstream of orfdwnl and encoded a 394 aa polypeptide. At 315 

bp upstream of orfdwnl and on the opposite strand was orfdwn3. Because no stop 

codon was observed for orfdwnS on the 3.7-kb fragment, it encoded an incomplete 

p 10 polypeptide of 219 aa. 

13 
Sj 

1^ Gene Disruption of the orfup and orfdwn open reading frames 

To assess the possible roles of the open reading frames flanking casl in the 
li biosynthesis of clavulanic acid and the other clavams produced by S. clavuligerus, 

Q 15 insertional inactivation or deletion mutants were created by gene replacement. The 
% method used for gene disruption and replacement was essentially as described by 

Paradkar and Jensen ( 1 995 ibid). 

fii 

A. orfup I 

20 A 1.5-kb Ncol-Ncol fragment carrying the apramycin resistance gene (aprO, 

constructed as described in Paradkar and Jensen (1995 ibid), was treated with 
Klenow fragment to generate blunted termini (Sambrook et al., 1989 ibid) and was 
ligated to pCEC026 that had been digested with Bsa&l and likewise treated with 
Klenow fragment. pCEC026 possesses a BsaBl site located within orfupl at 636 bp 

25 from the translational start codon. The ligation mixture was used to transform 
competent cells of E. coli GM 2163 (available from New England Biolabs, USA., 
Marinus, M.G. et al M G G (1983) vol 122, p288-9 ) to apramycin resistance. From 
the resulting transformants two clones containing plasmids pCEC054 and pCEC055 
were isolated; by restriction analysis pCEC054 was found to possess the apr r - 

30 fragment inserted in the same orientation as orfupl, while pCEC055 possessed it in 
the opposite orientation. 
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To introduce pCEC054 into S. clavuligerus, plasmid DNA was digested with 
BamHl and Hindlll and ligated to the high-copy number Streptomyces vector pIJ486 
(6.2 kb; Ward et al, (1986) Mol. Gen. Genet. 203: 468-478). The ligation mixture 
was then used to transform E. coli GM2163 competent cells to apramycin resistance. 
5 From the resulting transformants one clone, possessing the shuttle plasmid pCEC061, 
was isolated. This plasmid was then used to transform S. clavuligerus NRRL 3585. 
The resulting transformants were put through two successive rounds of sporulation 
on non-selective media and then replica plated to antibiotic containing media to 
identify apramycin-resistant and thiostrepton-sensitive transformants. From this 
10 process four putative mutants (61-1A, -2A.-3A and -4A) were chosen for further 
analysis. 

To confirm that these putative mutants were disrupted in orfupl genomic 
DNA was prepared from isolates 61-1A and 61-2A, digested with SacI and subjected 
to Southern blot analysis. The results of the Southern blot were consistent with a 

15 double cross-over having occurred and demonstrated that these mutants are true 
disruption replacement mutants in orfupl. 

The mutants 61-1 A, -2 A, -3 A and -4 A were grown in Soya-Flour medium and 
their culture supernatants were assayed by HPLC for clavulanic acid and clavam 
production. The composition of the Soya-Flour medium and the method for assaying 

20 clavams by HPLC were as previously reported (Paradkar and Jensen, 1995 ibid) 
except that the running buffer for the HPLC assay consisted of 0. 1 M NaH2P04 + 
6% methanol, pH 3.68 (adjusted with glacial acetic acid). The HPLC analysis 
indicated that none of the mutants produced detectable levels of clavam-2- 
carboxylate or 2-hydroxymethylclavam. Furthermore, when culture supernatants 

25 were bioassayed against Bacillus sp. ATCC 27860, using the method of Pruess and 
Kellett (1983, J. Antibiot. 36: 208-212)., none of the mutants produced detectable 
levels of alanylclavam. In contrast, HPLC assays of the culture supernatants showed 
that the mutants appeared to produce superior levels of clavulanic acid when 
compared to the wild-type ( Table 1 ). 

30 

Table 1 

Clavulanic acid titre ( CA) of orfupl mutants in shake flask tests 
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STRAIN 


70 HOURS 


70 HOURS 


93 HOURS 


93 HOURS 




CA ug /ml 


CA ug/mg DNA 


CA ug /ml 


CA ug/mg DNA 


NRRL 3585 #1 


87 


915 


166 


1963 


NRRL 3585 #2 


66 


790 


159 


1842 


61-1A 


272 


2894 


439 


6113 


6 1-2 A 


199 


2148 


225 


2928 


6 1-3 A 


54 


692 


221 


2585 


61 -4 A 


0 


0 


226 


2422 



B. orfdwnl and orfdwn2 

A deletion/replacement mutant in orfdwnl and orfdwnl was created by first 
5 digesting pCEC018 (7.3 kb) with Ncol and liberating a 1-kb subfragment containing 
most of orfdwnl and a portion of orfdwnl . The digest was fractionated by agarose- 
gel electrophoresis and the 6.3-kb fragment was excised and eluted from the gel. 
This fragment was then ligated to an Ncol-Ncol DNA fragment carrying apr r and 
used to transform E. coli XL 1 -Blue to apramycin resistance. One clone was obtained 

10 from this experiment but restriction analysis of the resulting recombinant plasmid 
revealed that two copies of the apramycin resistance fragment had been ligated into 
the deletion plasmid. To eliminate the extra copy of the a/?r r -fragment, the plasmid 
was digested with Ncol and self-ligated. The ligation mixture was used to transform 
E. coli GM2163 to apramycin resistance. From the transformants, two clones were 

15 isolated that contained plasmids pCEC052 and pCEC053 both of which possessed 
only one copy of the apr r -fragment; pCEC052 possessed the apr r -fragment inversely 
oriented with respect to orfdwnl and 2, while pCEC053 possessed the apr r -fragment 
inserted in the same orientation as orfdwnl and 2. 

A shuttle plasmid of pCEC052 was constructed by ligating BamHI-digested 

20 pCEC052 with similarly digested pIJ486 and transforming E. coli GM2163 to 
apramycin resistance. From this experiment one clone was isolated that contained the 
shuttle plasmid pCEC060. This plasmid was used to transform wild-type 5. 
clavuligerus 3585 to apramycin and thiostrepton resistance. The resulting 
transformants were put through two rounds of sporulation under non-selective 
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conditions and then replica plated to antibiotic containing media to identify apramycin 
resistant, thiostrepton sensitive colonies. Three putative mutants (60- J A, -2 A and - 
3 A) were chosen for further analysis. 

To establish the identity of these putative mutants genomic DNA was isolated 
5 from strains 60-1 A and 60-2A and digested with either SacI or BstEII and subjected 
to southern blot analysis.. The hybridisation bands generated from this experiment 
were consistent with both strains having undergone a double cross-over event 
demonstrating that these mutants are true disruption replacement mutants in 
orfdwnl/2. 

10 When these were cultured in Soya-Flour medium and their culture 

supernatants assayed by HPLC, none of the mutants produced detectable levels of 
clavam-2-carboxylate or 2-hydroxymethylclavam. A bioassay of the culture 
supernatants showed that the mutants also failed to produce detectable levels of 
alanylclavam. As with the orfupl mutants, the orfdwnl/2 mutants are capable of 

15 producing superior to wild-type levels of clavulanic acid (Table2). 

Table 2 



Clavulanic acid titre (CA) of orfdwnl/2 mutants in shake flask tests 



STRAIN 


70 HOURS 


70 HOURS 


93 HOURS 


93 HOURS 




CA ug/ml 


CA ug/mg DNA 


CA ug/ml 


CA ug/mg DNA 


NRRL3585#1 


87 


915 


166 


1963 


NRRL3585#2 


66 


790 


159 


1842 


60-1A 


164 


1872 


260 


2911 


60-2A 


187 


2013 


108 


1320 


60-3A 


79 


994 


214 


2161 



20 orfdwn3 

To disrupt orfdwn3 pCEC023 (consisting of a 3.7-kb fragment of casl 
downstream DNA subcloned into pSL1180) was digested with Ncol and then self 
ligated. After transforming E.coli with the ligation mixture a clone was isolated that 
possessed the plasmid pCEC031. This plasmid retained only the 1.9kb NcoI-EcoRI 
25 fragment encoding a portion of orfdwn2 and the incomplete orfdwn3. An 



- 10- 



P31731 



f 



examination of the DNA sequence revealed that pCEC03 1 possessed a unique BstEII 
site at 158bp from the translational start site of orfdwn3. Therefore, pCEC031 was 
digested with BstEII, treated with Klenow fragment to create blunt ends and then 
ligated to a blunted apramycin resistance cassette. The ligation mixture was used to 

5 transform E.coli GM2163 to apramycin resistance and ampicillin resistance. Two 
transformants were selected that contained respectively pCEC050 and pCECOSl. 
restriction analysis revealed that the apramycin resistance cassette was orientated in 
the same orientation as orfdwn3 in pCEC050 and in the opposite orientation in 
pCECOSl . Both of these plasmids were then digested with Hindlll and ligated to 

10 similarly digested pIJ486. The ligation mixtures were then used separately to 
transform E.coli GM2163 to apramycin and ampicillin resistance. The shuttle 
plasmids pCEC056 (pCEC050 + pIJ486) and pCEC057 (pCEC051+ pIJ486) were 
isolated from the resultant transformants. Both plasmids were then used to transform 
S.clavuligerus NRRL 3585. 

15 One transformant was selected from each transformant experiment and put 

through two successive rounds of sporulation on non-selective media and then replica 
plated to antibiotic containing media to identify apramycin-resistant and thiostrepton- 
sensitive transformants. From this process two putative mutants were isolated from 
the progeny of each primary transformant. (56-1 A and 56-3 A for pCEC056, and 57- 

20 1 C and 57-2B for pCEC057). 

To establish the identity of these putative mutants genomic DNA was isolated 
from these strains and digested with either SacI or Acc65I and subjected to Southern 
blot analysis. The hybridisation bands generated from this experiment were 
consistent with both strains having undergone a double cross-over event 

25 demonstrating that these mutants are true disruption replacement mutants in 
orfdwn3. 

When these strains were cultured in Soya-Flour medium and their culture 
supernatants assayed by HPLC, the mutants produced greatly reduced levels of 
clavam-2-carboxylate or 2-hydroxymethylclavam. A bioassay of the culture 
30 supernatants showed that the mutants also failed to produce detectable levels of 
alanylclavam. As with the orfupl and orfdwnl/2 mutants, the orfdwn3 mutants were 
capable of producing superior to wild-type levels of clavulanic acid (Table 3). 
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Table 3 

Clavulanic acid titre (CA) of orfdwn3 mutants in shake flask tests 





71 UflTTDC 


/I tiUUKo 


yj tiUUKb 


yj tiUUKo 




(~"A no /ml 
V--/T, Ug/inL 


CA 1 1 it An a DMA 


CA tin /ml 
\^r\ lig/filL 


CA ug/mg DNA 


NRRL 3585 #iA 


180 


1580 


193 


1790 


NRRL 3585 #1B 


179 


1640 


266 


2310 


56-1 A 


34 


110 


235 


2160 


56-3A 


225 


2140 


274 


2740 


57-1 C 


253 


2910 


277 


2920 


57-2B 


242 


2240 


193 


I860 



The application discloses the following nucleotide sequences: 



SEQ ID No. 1 
SEQ ID No. 2 
SEQ ID No. 3 
SEQ ID No. 4 
SEQ ID No. 5 
SEQ ID No. 6 
SEQ ID No. 7 



DNA sequence of Figure 1 
orfup3 sequence 
orfup2 sequence 
orfupl sequence 
orfdwnl sequence 
ofrdwn2 sequence 
orfdwn3 sequence 



20 



25 
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Figure 1: Nucleotide sequence of the S. clavuligerus chromosome 
25 including and flanking casl 



Ncol ..... 

30 l 

GGTACCGCCCGCCGCCGACGGGGCCTCGGAGCCGGCCTGGCCACTGGTCCTGGTGGGGCC 6 0 

MAPPPQGPAEAPGTVLVVG 

35 61 

ACCCTATC ACCGGGCGGTGGGCCGCGTCGTCTGAGGGCCTGTGCCTGGGCACCCACACGC 120 
TPYHGAVRRLLSGSVSGHTH 

<orfup3 

40 . 
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121 

GCCTTTCCGGGCCTCCGGCCCAGTGTCGGTGCCCATTGCGCGCCACAGGAACGGGCGC AT 180 
* LWPYRATDKGA 

Y 

ASLGPPRTM 



181 

TAGCCCCAGGTCTATCTGCTTCCGGGCCACCTGCTCCTTCAGGGCGTGGAGCATCTGGCA 240 
10 DPDLYVFARHVLFDRVEYV 



Q 

■Q 15 CGTGGTCGCGGGCCGCCGGGTGAGCCCCAGTGGGCGGGCGGTGCCGGGCAGGGCCACGAG 300 
SI CWRGAAWEPDGARWPGDRK 

H E 

W 

w 

7 20 3 01 

0 TGGCACCCACCACGGGAGGCGCCGCTCCTCAAGCCAGGGCCAGTCTTAGGTCAACTGCCT 3 60 
III GHTTGEAALLETGTLIWNV 

1 * 
1 - • 

361 

GGTGTCTACCACCCACTAGCTCGCCTACCACGGGGGCTCCAGCAGCTTCTCGGCCCGCTA 420 
WLHHTISRITGGLDDFLRA 



421 

gagcctgaacggggcccggtctggggtgaaccccttcttcttctggcgcaggagccgctt 480 
eskgralgwkpfffvadea 

35 f 



481 

CATC AGCTAGCGCCCCCACGGCAGCGACGGCTGCGGCGGCAACAGCTTGCGGAACTTCAT 540 
40 YDIAPTGDSGVGGNDFAKF 



<orfup2 



P31731 



f 



541 

GCGCCACTACTGGCGGAACGCGACGAGCAGGCAGTATGGCCGGCTACGGTGCCTGTACTT 600 
ATIVAKRQEDTMGASAVSM 



601 

TGCTGGAGGTCTCTAAGGCCCACCGACACGACCCCGACGCCTTCCCCACAGGGGGCGCTT 6 60 



10 661 

CCTGCCGCCTGCGGCGCCTGCGGCGCCGGCAGAGGGGCCGCCTGCCCAGGGTCGCAGGAC 720 



721 

1 5 CTCTCCCGAACCGCCGCCGAACTGCGGCACGACAGGGCGCCGAACGCCTTGCGCTTCATG 7 80 



781 

GCCGGTCGCATGCCCGCAACGTGGCCTGCACATGCGGCCAGCCCTGGGGAGCATGGGGGC 84 0 



841 

CTCGGCCGGCTGGGGCCGCCGAGGCCCCCATGCCTGCGCGGCCTGGCCGGGCTCGCTCGG 900 



25 



901 

CCTGCCCAGCCTGCCACGCGCACCAAGGCCACACAGCCTGTCGAGCCTGCCTGGCCTGCC 960 



30 961 

ACGCGCACCAAGGCCACACAGCCTGTCGAGCCTGCCCAGCCTGCCACGCGC ACC AAGGCC 1020 



1021 

3 5 GTGCGGCCTGCCCAGTC AACGGCT AGT ACCGCTCGTTACGGCCCCACATGGCGAGGGGCC 1080 
*NGIMALLAPTYREG 



1081 

40 TGTGGCCCACCCTCTAGCGCCGGC AGTGGAGGCGCTCCCTGGCCAGCAGGTCGGCCTAGC 1140 
SVPHS IAATVEALSRDDLRI 



1141 

45 TCCGCCGCCGCTCTAACAGGCGCTCTACCCGGCCCAAGCGCCACGGGCCCTAGCCCTGCT 1200 



P31731 



r 



SAAALNDALHAPNATGPI PV 



1201 

5 GCAGGAGCGGGGCCACCACGTCGGTCCGCTCGCGCTCGACACGGTCCCAGTCGGGGTCTG 12 60 
VDEGRHHLWALALQALTLGL 



1261 

10 GCAGGCGCTGGCCCGCGTCGGCCACGTCGTTGCTCGCCAACGCGCGCTCCCGGCCTCGCG 1320 
GDAVPRLRHLLSRNRALAPA 



1321 

15 ACTTGGCCCCGACCGGGGCCGCCTTCAGGAGCAGGGGGTCTAGCAGCCACCACGCCTACC 13 80 
SFRPQGRRFDEDGLDDTTRI 



1381 

20 ACGGCCACTCTTTTGGGGCAGGGTCTCCCCGCATTCGCTGCTAGGGCTAGGGGTCGAGGG 1440 
TGTLFGRGLPAYAVIGIGLE 



1441 

25 CCGTCTGCCCGTGGTGGAGCAGGAGCTAGGGCGCGCTGGTGTCCGAGGTGAGCGAGACGT 1500 
RCVPVVEDEIGRSWLSWESQ 



1501 

30 GGCGGCAGTGGCCCACGTGGCGCAGGCGGGCCGCGTCGCACCGGCGCCTCCCGAGCCTCT 15 60 
VATVPHVADARRLTAAS PES 



1561 

35 CTGGCTCGGACGCCTGGAACGGGAGCGCGTGGTCGAGCCGGTGGCGTGGGTGCCAGAGGA 162 0 
LGLRRVKGERVLEAVAGVTE 



1621 

40 GCTAGCCGTGGCGGCCCAGGCAGGTCACGACCATCATGTCCAGCTACGCCAGCCACGGCT 1680 
EI PVAPDTWHQYYLDIRDTG 



1681 

45 CTGCTGCGTCCCTGGC AAGCGTCCGGCGCGCCTGCATCCTGCCGAGCGGCGTGTTCGGGA 1740 



- 16- 



P3L731 

LRRLSRECAARVYSPEGCLG 



1741 

5 CCCTCCGCGGCAGCCTGCTCGCGTGGTACGGCTTGAACCACCGCTAGTCGTGGAGCAGGG 1800 
QSAGDSSRVMGFKTAILVED 



1801 

1 0 CCGCCGGGCGCTGGCGGGCAGGCTCGTCGAGGAGTGGCCGCGGCTCGGGGACCTGCAGCC 18 60 
RRGAVARGLLEEGAGLGQVD 



1861 

GCCACAGGTCGTCCCACTGGGGCCGCAGCTGCCGCCGCGCCTACCACCGGCAGCGGGCCC 1920 
ATDLLTVGADVAARITATAR 



1921 

GCGCCAGGCCCGCAGGCATCTTCAGCCACCAGCCGTCCGTCGGCTCGGGGACCCGTGACT 1980 
ARDPRGYFDTTPLCGLGQAS 



1981 

GGCCTTCC AGGGCGTCCCGCGCCTGGCCGCCTGCGCCTTGGCGCCGCCTGTGCCTTGGGC 2040 
VPLDRLARVPP RPVAASVSG 

<orfupl 

30 2041 

CCGGGGACTCGGGCGGAGAGCGGGACATACGGAACCTCCACAGGCGGAGCCGGGAACGGG 2100 

GGCCCCTGAGCCCGCCTCTCGCCCTGTATGCCTTGGAGGTGTCCGCCTCGGCCCTTGCCC 
APSEPPSRSM 
35 ..... 

2101 

ACGAGGGCGAGGACGGGACGGAACGAAGGAGAGGACGGGACGGAC AGC ACGGACGGGACG 2160 
40 TGCTCCCGCTCCTGCCCTGCCTTGCTTCCTCTCCTGCCCTGCCTGTCGTGCCTGCCCTGC 



f 



q 
■■- 

si 



m 

a 
m 
a 
.-. 



20 



45 



2161 

GACGGAACGGAGTCGGGAACCGGGGGGGGTGACCGGAACCGGGCCGTCCTTGGCCCTCCC 2220 



P31731 



i 



CTGCCTTGCCTCAGCCCTTGGCCCCCCCCACTGGCCTTGGCCCGGCAGGAACCGGGAGGG 



5 2221 

CCGTCCTCCCCGCCATCCGCCGTTCTCCCCCGTTCCCTCTCCCGTCCTCCAGCCAACACC 22 80 
GGCAGGAGGGGCGGTAGGCGGCAAGAGGGGGCAAGGGAGAGGGCAGGAGGTCGGTTGTGG 

10 

2281 

GCCGCCCTTTCCAAGCGCTTGACACGGCACCGACAGCCGCCGCCGGGCGCCCGATGGGGA 2340 
CGGCGGGAAAGGTTCGCGAACTGTGCCGTGGCTGTCGGCGGCGGCCCGCGGGCTACCCCT 

15 ..... 

2341 

CCCGTGCCCGCCGGTGAGCGGCGGTGAGCGCCGGTACGGGACCCCACGCGCCGCCGCCCG 2400 
20 GGGCACGGGCGGCCACTCGCCGCCACTCGCGGCCATGCCCTGGGGTGCGCGGCGGCGGGC 



2401 

GGCGCCCGCCAGGGCCCGCGCGGCCACCCCGGCCCGCCCCGGCCGGAGCGGCGATCCGGG 2 4 60 

25 

CCGCGGGCGGTCCCGGGCGCGCCGGTGGGGCCGGGCGGGGCCGGCCTCGCCGCTAGGCCC 



2461 

3 0 CCGCTCGCTGCAAGAGGAACATCCACAGCCGCACAAGGAGCGCTCCGCAC AGTGGGCACC 252 0 
GGCGAGCGACGTTCTCCTTGTAGGTGTCGGCGTGTTCCTCGCGAGGCGTGTCACCCGTGG 



35 2521 

ACGTCCGCCCCGTCCCCCACACCGTGGCCGGTCCCCACCGGACAGCACAGCACCGCACAG 2 580 
TGCAGGCGGGGCAGGGGGTGTGGCACCGGCCAGGGGTGGCCTGTCGTGTCGTGGCGTGTC 

40 . 

2581 

CACCAC ATCGCACGGCACAGCACAGCACCACCGGCACGAGGAACCAAGGAAAGGAACCAC 2 640 



GTGGTGTAGCGTGCCGTGTCGTGTCGTGGTGGCCGTGCTCCTTGGTTCCTTTCCTTGGTG 

45 



P31731 



i 



casl> 

MTSVDCTAYGPELRALAA 

2641 

ACCACCATGACCTCAGTGGACTGCACCGCGTACGGCCCCGAGCTGCGCGCGCTCGCCGCC . 2700 

5 

TGGTGGTACTGGAGTCACCTGACGTGGCGCATGCCGGGGCTCGACGCGCGCGAGCGGCGG 



2701 

10 CGGCTGCCCCGGACCCCCCGGGCCGACCTGTACGCCTTCCTGGACGCCGCGCACACAGCC 27 60 
RLPRTPRADLYAFLDAAHTA 



2761 

1 5 GCCGCCTCGCTCCCCGGCGCCCTCGCC ACCGCGCTGGAC ACCTTC AACGCCGAGGGC AGC 2820 
AASLPGALATALDTFNAEGS 



2821 

20 GAGGACGGCCATCTGCTGCTGCGCGGCCTCCCGGTGGAGGCCGACGCCGACCTCCCCACC 2880 
EDGHLLLRGLPVEADADLPT 

NcoT . 

25 2881 

ACCCCGAGCAGCACCCCGGCGCCCGAGGACCGCTCCCTGCTGACCATGGAGGCCATGCTC 2940 
TPSSTPAPEDRSLLTMEAML 

KpnI. 

30 . 

2941 

GGACTGGTGGGCCGCCGGCTCGGTCTGCACACGGGGTACCGGGAGCTGCGCTCGGGCACG 3000 
GLVGRRLGLHTGYRELRSGT 

35 . 

3001 

GTCTACCACGACGTGTACCCGTCGCCCGGCGCGCACCACCTGTCCTCGGAGACCTCCGAG 30 60 
VYHDVYPSPGAHHLSSETSE 

40 

3061 

ACGCTGCTGGAGTTCCACACGGAGATGGCCTACCACCGGCTCCAGCCGAACTACGTCATG 3120 
TLLEFHTEMAYHRLQ PNYVM 



45 



P31731 



3121 

CTGGCCTGCTCCCGGGCCGACCACGAGCGCACGGCGGCCACACTCGTCGCCTCGGTCCGC 3180 
LAC SRADHERTAATLVASVR 

5 

3181 

AAGGCGCTGCCCCTGCTGGACGAGAGGACCCGGGCCCGGCTCCTCGACCGGAGGATGCCC 3240 
KALPLLDERTRARLLDRRMP 



3241 

TGCTGCGTGGATGTGGCCTTCCGCGGCGGGGTGGACGACCCGGGCGCCATCGCCCAGGTC 3300 
CCVDVAFRGGVDDPGAIAQV 



3301 

AAACCGCTCTACGGGGACGCGGACGATCCCTTCCTCGGGTACGACCGCGAGCTGCTGGCG 33 60 
KPLYGDADDPFLGYDRELLA 



3361 

CCGGAGGACCCCGCGGACAAGGAGGCCGTCGCCGCCCTGTCCAAGGCGCTCGACGAGGTC 3420 
PEDPADKEAVAALSKALDEV 



3421 

ACGGAGGCGGTGTATCTGGAGCCCGGCGATCTGCTGATCGTCGACAACTTCCGCACCACG 3480 
TEAVYLEPGDLL IVDNFRTT 



3481 

CACGCGCGGACGCCGTTCTCGCCCCGCTGGGACGGGAAGGACCGCTGGCTGC ACCGCGTC 3540 
HARTPFSPRWDGKDRWLHRV 

35 . 

3541 

TACATCCGCACCGACCGCAATGGACAGCTCTCCGGCGGCGAGCGCGCGGGCGACGTCGTC 3600 
YIRTDRNGQLSGGERAGDVV 

40 AFTPRG* Sad 

3601 

GCCTTCACACCGCGCGGCTGAGCTCCCGGGTCCGACACCGCGCGGCTGAACCCACGGTCC 3 6 60 
45 CGGAAGTGTGGCGCGCCGACTCGAGGGCCCAGGCTGTGGCGCGCCGACTTGGGTGCCAGG 



O 

H 

w 
y 

CO 

b 
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3661 

GGGGCCCACGGTCCGGCACCGCGCGGCTGAGCCCCCGGGTCCGGCAGCGGGCGGCTGAAC 3720 

5 

CCCCGGGTGCCAGGCCGTGGCGCGCCGACTCGGGGGCCCAGGCCGTCGCCCGCCGACTTG 



3721 

1 0 CCCCGCCCCGGGCCACCGCCCGACCGCCCCCGCGC ACCGGACGCGCCCGCCTGTACGGCG 3780 
GGGGCGGGGCCCGGTGGCGGGCTGGCGGGGGCGCGTGGCCTGCGCGGGCGGACATGCCGC 

15 3781 

GTCCCGCCCGGGCCCGTACACCTGAAGCGCCCGGCGGACCGCCGCCCCGCCGGGGGACGG 3840 
CAGGGCGGGCCCGGGCATGTGGACTTCGCGGGCCGCCTGGCGGCGGGGCGGCCCCCTGCC 

20 > < 

3841 

ACAGAGCCGGGTGCGGGAGGACGTCCTCCCGCACCCGGCTCCCACCGTTCCGCACCGACC 3900 
25 TGTCTCGGCCCACGCCCTCCTGCAGGAGGGCGTGGGCCGAGGGTGGCAAGGCGTGGCTGG 



3901 

GCACCCGACCGTGCCGCAGGCGCCACCGGCACCGCACCGCCCGCGCCGGCAGCCACCACA 3960 

30 

CGTGGGCTGGCACGGCGTCCGCGGTGGCCGTGGCGTGGCGGGCGCGGCCGTCGGTGGTGT 



3961 

35 GGCGCCACGCCGCCCGCACGGTGCCCGCGCTGCTCAGCCCCCGTCCACCGGGCTGTCCAG 4020 

CCGCGGTGCGGCGGGCGTGCCACGGGCGCGACGAGTCGGGGGCAGGTGGCCCGACAGGTC 

*GGDVPSD 

L 

40 ..... 

4021 

GTCGGCGGCGTCGCGCGGGGGCTACTTGAGGGCCAGCCGCCGGCTGGGGGGCCTGGGGCG 4080 
LRRLAGGI FERDAASGGSG 

45 A 
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4081 

CTCTACGGGGGTGTGAGGGCCCTAGTGGAGGTCGCTCCGTATGCCGTCGTCTAGCCGGTG 4140 
5 LHGWVGPIVELSAYPLLDA 
V 



4141 

GGCGAAGAGCAGGAGCTGCCGCTTTGTGTGC AGGTCCCGCGGGCCGTCGTGGTGCCGGGC 4200 
RKEDEVAFCVDLAGPLVVA 

R 



4201 

GCGGCACTGCCTCCGGTCGCGGCGGAGCTGCGAGGGGGGCCGGGGCCCACAGCGGGGGTG 42 60 
ATVSALAAEVSGGAGPTAG 

V 

Ncol . 

4261 

TAGGCACAAGAGGGTCCACGCGTGGTACCACTCGTCTAGGCGCCGCGGCCCGGGCCTCTC 4320 
DTNEWTRVMTLLDAAGPGS 

L 



4321 

CTTCTGGACGAGGGTCTTCGGCCACTCCATGAGGAGCGCCCACCGCTTTGGGTCGAGGGC 4380 
30 FVQEWFGTLYEERTAFGLE 
R 



4381 

35 CACCCGTGCCGCCCGGGTCTTCCTTGCGCTCCAGGGGGTGGGCCGCTTGTGGGCCGGGCG 4440 
HARRAWFSRSTGWGAFVRG 

A 



40 4441 

GCGGAAGGCGGGGGCGAGGGGCCGCAGCCGCGACTCGCGGCGCCGGTCTGGCCTGTCGTC 4500 
AKRGREGADASLAAALGSL 

L 

45 



M 

■ 



20 



r 
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4501 

CTGGTCCGACACGCCCGACGAGTGGCCGCGGGGCGTCTAGCCCCGCTAGGCCGCGTGGTA 456 0 
VLSHPSSVPAGCIPAIRRV 

M 

5 ..... 

4561 

GGGGCCTACGCTGTGCCGGGTGACCATCCGCACCCGGCGCGGGTAGCTGGTCGGGCACTG 462 0 
GPHSVAWQYAHAAGMSWGT 

10 v 



4621 

GTCCCGGTCAAGGGCATGGGGGTCGAGGAGCCACTCGTCGGCCACGACGCGGCGCTGTAA 4680 
15 LALERVGLEETLLRHQAAV 
N 



4681 

20 CAGGACGCCTCACTAGTCGCCTTTCGCCCTGGGGCTGCCCACCAACGGCCCGCTCGACCT 4740 
DQPTILPFRSGSPHNGPSS 

S 



25 4741 

CTGGGGCAACGGCTTCTCAGGCCGCCACTGCTGCGTCATGGCGGCCCACAGGTCGCCGTC 4 800 
VGNGFLGATVVCYRRTDLP 

L 

30 . 

4801 

GGGGCGTGGCTAGTCGGTCAGCATGGGCCACACCAGGGCCGGCTTCTTGCTGCCTGTCTC 48 60 
GAGILWDYGTHDRGFFSPC 

L 

35 ..... 

4861 

GTGGTGCAAGCAGGGCAGCCGCAAGCCGCACGGCATGTACCGCATTGGCTAGGCCCGC AG 492 0 
VVNTGDANPTGYMAYGIRA 

40 D 



<orfdwnl 
4921 

45 GGCGTCCTGGAGGGGC AGGTCGTTGCCGTCAAGCAGCTAGAGCTTATACGCCGTAAGGTG 4980 



-23- 



P31731 



RLVEGDLLPLEDIEFIRCE 



5 4981 

GCGACTGGAGGAACAAGCTAGGGGGGCCTGTTGTCCAGCCAGCACCGGCCTCTGAGTCTC 5040 



10 . 

5041 

GGTCAACCCCCGCTAGAGCCACCGGGTGTCGAGGTCCGACGCGTCGACCTGTAGCACGCC 5100 
WNPAIETAWLELSRLQVDH 

P 

15 ..... 

5101 

CTAGTCGGGCCTCATGACCGTGACCTCGTCTATGAGGCCTAGCACGGCGAGGTGGTCGAA 5160 
ILGSYQCQLLYEPDHREVL 

20 k 



5161 

GAGCTAGTACGCCAACTACAGCAGGCCCCACGGCTGGGTGAGGTCGGGGGCC AGCTGGTC 5220 
25 EIMRNIDDPTGVWELGRDV 



5221 

30 CC AGAAC ATCAGGCTCGGCTAGCCTGGGC AGAGCGGCCAGCGCGCGTCGCGGAGCCACTT 5280 
TKYDSGIPGTEGTARLAET 



NcoT ..... 

35 . 

5281 

CGGGTACCCCGGCTTGGTCAAGAGCTTCTACTTCGGCGGCGGCGCCCTGCGGGTCACCAC 5340 
GMPGFWNEFIFGGGRSAWH 

H 

40 ..... 

5341 

ccggagcggcctcagggccctctggtcctgcaggaagtagtggggctgggcgagcggggc 5400 
aegsdrsvlvdkmvgvreg 

45 r 



P31731 



5401 

GGCGTCCCACGGC ACCGGGCGGCGGAGCCGGAGGAGGGCCATCTACAGGTAGTCGGCCCG 5460 
5 RLTGHGAAEAEERYIDMLR 
A 



5461 

CTGCTAGACCAGC AGCCAC AAGTAGTCCTAGCCGTGGTGCGGGAGGGCCCGTGTCTTGGC 5520 
VIQDDTNMLIPVVGERACF 

R 



5521 

CTTGCACAGGAGTGACTTCGACTTGCCGACCTTCTGCCCGCCCACCCCCGCGACCATCCC 55 80 
FTDESFSFPQFVPPHPRQY 

P 



5581 

GAACCCGCGCTACGGGTGGAGCGCCTACTGCGGCAAGAGCAGCTCCGGGGCCGGCATCGC 5640 
KPAIGVERIVGNEDLGRGY 

R 



5641 

CGCGTGGCGGAGCATCCCCTTGAGGTCCAGGCCGTGGCCCTAGC AGGTGACGAGGGGCCT 5700 
RVAEYPFELDPVPITWQEG 

30 s 



5701 

CACCCACTTGCAGAGCCAGC AGGTGCGGAAGAACTACTAGAGGGTCACGAGGAGCTTCTC 57 60 
35 HTFTETTWAKKI IEWHEEF 

L 



5761 

40 CCGTGCTAACGCGGCCAGGGCGAGGGGCCGCAGCCTGTCCCACGGCGGCTGGGGCATGTG 582 0 
ARNRRDREGADSLTGGVGY 

V 
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( 



5821 

GACGGGGTACTACAGCCGGGTCGCGAAGACCTTGGGCGCGCGCTAGGGCTGCTTCCGCGC 5880 
QGMI DAWRKQFGRAIGVFA 

R 

5 ..... 

5881 

CGGGGCCCAGTACACCAGCTCGTAGCGGTCTAGGAGCCGGTCGGCGTCGCCTAACACGTC 5 940 
GRTMHDLMALDEALRLPNH 

10 L 



5941 

GCCGTCCTGCAACCGGTAGACCGGCTGGGCCTAC ACGGCCCAGACGTACGGCTCCATCTC 6000 
15 PLVNAMQGVRIHRTQMGLY 
L 



6001 

20 GGGGTCGTACTAGCCCAACAACCTCTGGAGCTTTGGGAGCCACACCTTCACCACGAGCCA 6060 
GLMI PNNSVEFGETHFHHE 

T 



25 6061 

CTTCCTGTCAGGGGTCATCGGCTCAAGCAGCCGGCGGACGCGGACGGCCCACTCGACGGC 6120 
FSLGWYGLEDAAQAQRTLQ 

R 

30 . 

6121 

CTCGTACAAGACCATCAAGACGCCTAACTGGGGGCGGTATGGGGCGACCTGGACGCGTAC 6180 
LMNQYNQPNVGAMGRQVQA 

H 

35 

<orfdwn2 

6181 

ACTGCCGACCGTTGGCAGATAGAAGAGAATGGACTTCACCCTGGCTCCTCCGGTTCGCGG 6240 

40 

TGACGGCTGGCAACCGTCTATCTTCTCTTACCTGAAGTGGGACCGAGGAGGCCAAGCGCC 
SGVTPLYFLISKM 
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6241 

CGCCCTCCATTGACGTGCGCCGAAAGCGGCTCGACCGTCCCACTCCGCCCTTGAGTTCCG 6300 
GCGGGAGGTAACTGCACGCGGCTTTCGCCGAGCTGGCAGGGTGAGGCGGGAACTCAAGGC 



6301 

TCTGACGCCGCGCCAGTCGGCGGGCCGTCCGCCGGGGTGCCCGCCGGGGTCCGCACCCGC 63 60 
1 0 AGACTGCGGCGCGGTCAGCCGCCCGGCAGGCGGCCCCACGGGCGGCCCCAGGCGTGGGCG 



6361 

CGGACGGC ACGGCGCGC ACCGCGCGCGCGGCGCTTCGGGGCACCGGGCTCGACGGGGTGC 642 0 
GCCTGCCGTGCCGCGCGTGGCGCGCGCGCCGCGAAGCCCCGTGGCCCGAGCTGCCCCACG 



6421 

20 TCAGCGGGACGTCCAACGGAAGGCAAGCCCCCGTACCCAGCCTGGTCAAGGCGCTCATCG 648 0 
AGTCGCCCTGCAGGTTGCCTTCCGTTCGGGGGCATGGGTCGGACCAGTTCCGCGAGTAGC 

orfdwn3> 

25 .....MP 

G 

64S1 

CCATTCCCTGAGGAGGTCCCGCCTTGACCACAGCAATCTCCGCGCTCCCGACCGTGCCCG 6540 
30 GGTAAGGGACTCCTCCAGGGCGGAACTGGTGTCGTTAGAGGCGCGAGGGCTGGCACGGGC 



6541 

GCTCCGGACTCGAAGCACTGGACCGTGCCACCCTCATCCACCCCACCCTCTCCGGAAACA 660 0 
SGLEALDRATLIHPTLSGN 



6601 

40 CCGCGGAACGGATCGTGCTGACCTCGGGGTCCGGCAGCCGGGTCCGCGACACCGACGGCC 6660 
AERIVLTSGSGSRVRDTDG 
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6661 

GGGAGTACCTGGACGCGAGCGCCGTCCTCGGGGTGACCCAGGTGGGCCACGGCCGGGCCG 6720 
EYLDASAVLGVTQVGHGRA 

E 

5 ..... 
6721 

AGCTGGCCCGGGTCGCGGCCGAGCAGATGGCCCGGCTGGAGTACTTCCACACCTGGGGGA 67 80 
LARVAAEQMARLEYFHTWG 

10 T 



6781 

CGATCAGCAACGACCGGGCGGTGGAGCTGGCGGCACGGCTGGTGGGGCTGAGCCCGGAGC 6840 
ISNDRAVELAARLVGLSPE 

P 



6841 

CGCTGACCCGCGTCTACTTCACCAGCGGCGGGGCCGAGGGCAACGAGATCGCCCTGCGGA 6900 
LTRVYFTSGGAEGNEIALR 

M 



6901 

TGGCCCGGCTCTACCACCACCGGCGCGGGGAGTCCGCCCGTACCTGGATACTCTCCCGCC 69 60 
ARLYHHRRGESARTWILSR 

R 



6961 

GGTCGGCCTACCACGGCGTCGGATACGGCAGCGGCGGCGTCACCGGCTTCCCCGCCTACC 7020 
SAYHGVGYGSGGVTGFPAY 

H 



7021 

ACCAGGGCTTCGGCCCCTCCCTCCCGGACGTCGACTTCCTGACCCCGCCGCAGCCCTACC 7080 
QGFGPSLPDVDFLT PPQPY 

40 R 



7081 

GCCGGGAGCTGTTCGCCGGTTCCGACGTCACCGACTTCTGCCTCGCCGAACTGCGCGAGA 7140 



M 

U 
I 

V 

n 

Q 



a 
m 



30 
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RELFAGSDVTDFCLAELRE 



Sail 

5 7141 CCATCGACCGGATCGGCCCGGAGCGGATCGCGGCGATGATCGGCGAGCCGATC 

IDRIGPERIAAMIGEPI 
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